Instant Recovery with Write-Ahead Logging: Page Repair, System Restart, and Media Restore

نویسندگان

  • Goetz Graefe
  • Wey Guy
  • Caetano Sauer
چکیده

Traditional theory and practice of write-ahead logging and of database recovery techniques revolve around three failure classes: transaction failures resolved by rollback; system failures (typically software faults) resolved by restart with log analysis, “redo,” and “undo” phases; and media failures (typically hardware faults) resolved by restore operations that combine multiple types of backups and log replay. The recent addition of single-page failures and single-page recovery has opened new opportuni-ties far beyond its original aim of immediate, lossless repair of single-page wear-out in novel or traditional storage hardware. In the contexts of system and media failures, efficient single-page recovery enables on-demand incremental “redo” and “undo” as part of system restart or media restore operations. This can give the illusion of practically instantaneous restart and restore: instant restart permits processing new queries and updates seconds after system reboot and instant restore permits resuming queries and updates on empty replacement media as if those were already fully recovered. In addition to these instant recovery techniques, the discussion introduces much faster offline restore operations without slowdown in backup operations and with hardly any slowdown in log archiving operations. The new restore techniques also render differential and incremental backups obsolete, complete backup commands on the database server practically instantly, and even permit taking full backups without imposing any load on the database server.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Instant Restore After a Media Failure

Media failures usually leave database systems unavailable for several hours until recovery is complete, especially in applications with large devices and high transaction volume. Previous work introduced a technique called single-pass restore, which increases restore bandwidth and thus substantially decreases time to repair. Instant restore goes further as it permits read/write access to any da...

متن کامل

A Cost-Effective Method for Providing Improved Data Availability During DBMS Restart Recovery After a Failure

We present a cost-effective method for improving data availability during restart recovery of a data base management system (DBMS) after a failure. The method achieves its objective by enabling the processing of new transactions to begin even before restart recovery is completed by exploiting the Comnlt-rs~V concept. It supports finegranularity (e.g., record) locking with semanticallyrich lock ...

متن کامل

An empirical analysis of database recovery costs

The time required for recovery from a failure is heavily influenced by hardware setup and workload characteristics. In bad but still realistic cases, the recovery required during restart can take hours. For a database system based on write-ahead logging, we performed a qualitative study of how hardware and software configurations affect the behavior of the database and, consequently, how this b...

متن کامل

Fast Crash Recovery for a Distributed Column-Store Database Management System

A traditional database management system (DMBS) supports both atomic transactions and crash recovery by maintaining an undo/redo log. The standard write-ahead logging protocol, which requires a system to force the undo and redo log records describing a modified page to stable storage before writing the modified page to disk, ensures that sufficient information exists to perform recovery. After ...

متن کامل

Segment-based recovery: Write ahead logging revisited

Although existing write-ahead logging algorithms scale to conventional database workloads, their communication and synchronization overheads limit their usefulness for modern applications and distributed systems. We revisit write-ahead logging with an eye toward finer-grained concurrency and an increased range of workloads, then remove two core assumptions: that pages are the unit of recovery a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014